Traffic flow and efficient routing on scale-free networks: A survey 
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Recently, motivated by the pioneer works in revealing the small- world effect and scale-free property 
of various real-life networks, many scientists devote themselves to studying complex networks. In 
this paper, we give a brief review on the studies of traffic flow and efficient routing on scale-free 
networks, including the traffic dynamics based on global routing protocol, Traffic dynamics based 
on local routing protocol, and the critical phenomena and scaling behaviors of real and artificial 
traffic. Finally, perspectives and some interesting problems are proposed. 
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I. INTRODUCTION 



Many social, biological, and communication systems 
can be properly described as complex networks with ver- 
tices representing individuals or organizations and links 
mimicking the interactions among them Q, 0, 0, H] • One 
of the ultimate goals of the current studies on topologi- 
cal structures of networks is to understand and explain 
the workings of systems built upon those networks, for 
instance, to understand how the topology of the World 
Wide Web affects Web surfing and search engines, how 
the structure of social networks affects the spread of dis- 
eases, information, rumors or other things, how the struc- 
ture of a food web affects population dynamics, and so 
on. The increasing importance of large communication 
networks such as the Internet [5| , upon which our society 
survives, calls for the need for high efficiency in handling 
and delivering information. Therefore, to understand the 
traffic dynamics and find the optimal strategies for traf- 
fic routing is one of the important issues we have to ad- 
dress. There have been many previous studies to under- 
stand and control traffic congestion on networks, with a 
basic assumption that the network has a homogeneous 
structure [E H> El EiJ ■ However, many real-life com- 
munication networks, such as the Internet |llj] and the 
World- Wide- Web [T3I , display scale- free degree distribu- 
tion 0,0], thus it is of great interest to study the traffic 
flow on scale-free networks. In this light, the traffic dy- 
namics on complex networks have recently attracted a 
large amount of interest from the physical community. 

In this paper, we will give a brief review about traf- 
fic dynamics on scale-free networks. This paper is orga- 
nized as follow: In Sec. 2 and 3, the traffic dynamics 
with global and local routing protocols are introduced, 
respectively. In Sec. 4, the critical phenomena and self- 
similarity scaling of real traffic and artificial models are 
discussed. Finally, we sum up this paper in Sec. 5. 



II. TRAFFIC DYNAMICS BASED ON GLOBAL 
ROUTING PROTOCOL 

In this section, we discuss the case where the whole 
topological information is available for each router. For 
simplicity, all the nodes are treated as both hosts and 
routers. The simplest model can be described as follows: 

(1) At each time step, there are R packets generated in 
the system, with randomly chosen sources and destina- 
tions. Once a packet is created, it is placed at the end of 
the queue if this node already has several packets waiting 
to be delivered to their destinations. 

(2) At each time step, each node, i, can deliver at most 
Ci packets one step toward their destinations according 
to the routing strategy. 

(3) A packet, once reaching its destination, is removed 
from the system. 

We are most interested in the critical value R c where a 
phase transition takes place from free flow to congested 
traffic. This critical value can best reflect the maximum 
capability of a system handling its traffic. In particular, 
for R < R c , the numbers of created and delivered pack- 
ets are balanced, leading to a steady free traffic flow. For 
R > R c , traffic congestion occurs as the number of ac- 
cumulated packets increases with time, simply for that 
the capacities of nodes for delivering packets are limited. 
To characterize the phase transition, we use the following 
order parameter 
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where AW = W(t + At) - W(t), with (• • • } indicating 
average over time windows of width At, and W(t) is the 
total number of packets in the network at time t. Clearly, 
H equals zero in the free-flow state, and becomes positive 
when R exceeds R c . 

Since in the Internet, the deviation of traffic from the 
shortest path is only about 10% 0], one can assume 
that the routing process takes place according to the cri- 
terion of the shortest available path from a given source 
to its destination. Accordingly, firstly, we investigate 
the shortest-path routing strategy, which can be imple- 
mented by either of the two ways, finding the shortest- 
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FIG. 1: The order parameter H versus R for two-dimensional 
lattice (a) and BA networks (b) with the same size N = 1225 
and average degree (k) — 4. The delivering capability of 
each node is simply set as a constant C = 1. The shortest- 
path routing strategy yields Rf ttice w 60 and i?^ A ~ 4.0, 
respectively. 



path dynamically or following the fixed routing table. In 
the former case for each newly generated packet, 
the router will find a shortest path between its source 
and destination, and then, the packet is forwarded along 
this path during the following time steps. In the latter 
case [ljj , for any pair of source and destination, one of all 
the shortest paths between them is randomly chosen and 
put into the fixed routing table that is followed by all the 
information packets. Compared with the dynamical rout- 
ing algorithm and the information feed-back mechanism, 
the fixed routing algorithm is much more widely used in 
real communication systems for its obvious advantages 
in economical and technical costs EH. Actually, the 
behaviors of those two cases are pretty much the same 
[l^.[r^| . thus we will not distinguish them hereinafter. 

If the delivering capability of each node is the same, 
the critical point R c of highly heterogeneous network will 
be much smaller than that of homogeneous network. It 
is because that when all the packets follow their shortest 
paths, it will easily lead to the overload of the heavily- 
linked router, and the congestion will immediately spread 
over all the nodes. Fig. 1 shows the order parameter H 
versus R for (a) the two-dimensional lattice with periodi- 
cal boundary condition and (b) the Barabasi- Albert (BA) 
network 0,0] with average degree (fc) = 4. Clearly, the 
throughput, measured by the R c , of regular network is 
much larger than that of scale-free networks. 

To provide the theoretical estimate of the value of R c , 
we first introduce the concept of betweenness central- 
ity (see also the original concept of centrality [23, l2l| . 
the generalized concept of centralit y 123 . the physical 
meaning of betweenness centrality [23|, and some re- 
cently proposed algorithms for calculating betweenness 
[24l EH l^q). The betweenness centrality of a node v is 
defined as 



FIG. 2: (Color online) The critical R c vs /3 for BA network 
with average degree (k) = 4 and of size N = 1225 (left panel) 
and N = 1500 (right panel). Both simulation and analysis 
demonstrate that the maximal R c corresponds to /3 ~ 1. The 
results are the average over 10 independent runs. 



where o~ st is the number of shortest paths going from s to 
t and <7 s t{v) is the number of shortest paths going from s 
to t and passing through v. Below the critical value R Cl 
there is no accumulation at any node in the network and 
the number of packets that arrive at node v is, on average, 
RB V /N(N—1). Therefore, a particular node will collapse 
when RB V /N(N — 1) > C v . Considering the transferring 
capacity of each node is fixed to C and congestion occurs 
at the node with the largest betweenness centrality, R c 
can be estimated as Ufa 123 



R c = CN(N - 1)/B nax , 



(3) 



Q~st(v) 
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where B max is the largest betweenness centrality of the 
network. This equation illuminates that the network of 
larger £? max has smaller throughput. 

For many real-life networks, the betweenness centrality 
is strongly correlated with degree. In general, the larger 
the degree, the larger the centrality. For many scale-free 
networks, it has been shown that the betweenness cen- 
trality approximately scales as B(k) ~ fc p [281 Efll] . where 
B(k) denotes the average betweenness centrality over all 
the A;-degree nodes. Therefore, in a heterogeneous net- 
work, there exists a few high-betweenness nodes, named 
hub nodes, which are easy to be congested. This is pre- 
cisely the cause of low throughput of scale- free networks. 

To enhance the traffic capability, Zhao et al. proposed 
two traffic models 0], where the delivering capability 
of a node i is assigned as C, = 1 + /3fej (Model I) and 
d = l + 0Bi (Model II), respectively. Here, < (3 < 1 
is a control parameter. As we have mentioned above (see 
Eq. (3)), it is clear that the throughput of the whole 
network will increase if the hub nodes have higher deliv- 
ering capability though the total capability Ci keeps 
unchanged. This work suggests a way to alleviate traffic 
congestions for highly heterogeneous networks: making 
nodes with large betweenness as powerful and efficient 
as possible for processing and transmitting information. 
Particularly, in the model II, the throughput R c is inde- 
pendent of the network topology. 

However, there are two reasons that hinder the ap- 
plication of those models. First, the capability /power 
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distributions in some real networks are homogeneous, al- 
though their degree distributions are heterogeneous |3Cj . 
For example, in the broadcasting networks, the forward- 
ing capacity of each node is limited. Especially, in wire- 
less multihop ad hoc networks, each node usually has 
the same power thus almost the same forwarding capac- 
ity [sj. Second, the structure of real networks evolve 
ceaselessly, thus the degree and betweenness centrality 
of each node vary ever and again. By contrary, one can 
not change the delivering capability of a node freely due 
to the technical limitation. 

For the case that each node has the same delivering 
capability C, Yan et al. proposed a novel routing algo- 
rithm to enhance the network throughput ■ Note 
that, the path with shortest length is not necessarily the 
quickest way, considering the presence of possible traf- 
fic congestion and waiting time along the shortest path. 
Obviously, nodes with larger degree are more likely to 
bear traffic congestion, thus a packet will by average 
spends more waiting time to pass through a high-degree 
node. All too often, bypassing those high-degree nodes, 
a packet may reach its destination quicker than taking 
the shortest path. For any path between nodes i and j 
as P(i — * j) := i = xq, x±, ■ ■ ■ x n -\,x n = j, denote 

n-i 

L{P{i^j):P)=Y. k ^f' W 

i=0 

where f3 is a tunable parameter. The efficient path be- 
tween i and j is corresponding to the route that makes the 
sum L(P(i — » j) : (3) minimum. Obviously, L m i n ((3 = 0) 
recovers the traditionally shortest path length. All the 
information packets follow the efficient paths instead of 
the shortest paths. 

In Fig. 2, we report the simulation results for the crit- 
ical value R c as a function of [3 on BA network with 
the size N — 1225 and N = 1500, which demonstrate 
that the optimal router strategy corresponding to (3 = 1 
and the size of BA network doesn't affect the value of 
optimal (3. In comparison with the traditional routing 
strategy (i.e. (3 — 0), the throughput R c of the whole 
network is greatly improved more then 10 times without 
any increase in algorithmic complexity. By extending the 
concept of betweenness centrality to efficient betweenness 
centrality, that is, using the efficient paths instead of the 
shortest paths in the definition of betweenness central- 
ity, the analytic results can be obtained according to the 
Little's law jlrj I27L l32j| . The analytical results are also 
shown in Fig. 2, which agree very well with the simula- 
tions. In the previous studies, the betweenness centrality 
is always considered as a static topological measure of 
networks under the latent assumption that all the infor- 
mation packets go along the shortest paths from source 
to destination. The work of Yan et al. shows that this 
quantity (efficient betweenness) is determined both by 
the routing algorithm and network topology, thus one 
should pay more attention to the design of routing strate- 
gies. For example, a more intelligent router that can de- 
tour at obstacle performs much better than traditional 
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FIG. 3: The critical R c vs a for BA network with average 
degree (fc) = 10 and of size N = 1000. The delivering capa- 
bility of each node is set as a constant C = 10. The results 
are the average over 10 independent runs. 

router which just waits at obstacle |34j . and a recent 
work demonstrates that the dynamical information can 
be used to design more efficient routing strategy |35| . 

III. TRAFFIC DYNAMICS BASED ON LOCAL 
ROUTING PROTOCOL 

Although the routing protocol using global topological 
information is very efficient, it is not practical for huge- 
size communication networks and the evolving networks 
since the technical limitation of the router. It is because 
the router hardware is hard to be designed to have the 
capability to storage much information or adapt dynami- 
cal information. Therefore, it is also very interesting and 
practical to study the traffic behaviors on scale-free net- 
works based on local information. The simplest network 
traffic model on local protocol is the random-walk pro- 
cess, where each packet is delivered to randomly selected 
one of its neighbors as far as it reaches the destination. 
The random-walk on scale-free networks has been exten- 
sively explored previously |3ri l37l l38l|. however, it is far 
from the real traffic since it can not reproduce the self- 
similar scaling as we will present in the next section. 

Motivated by the previous studies about searching en- 
gine |39l |40j and the global routing strategy |32( on com- 
plex networks, Yin et al. proposed a traffic model using 
preferential selection among local neighbors [4Lj . In this 
model, to navigate packets, each node performs a local 
search among its neighbors. If a packet's destination is 
neighboring, it will be delivered directly to its target, oth- 
erwise, it will be forwarded to a neighbor j of its currently 
located node i according to the preferential probability 



where the sum runs over the neighbors of node i, fej is 
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than the one using fixed routing algorithm. And the 
throughput of networks is very low without the PIA rule 
due to many times of unnecessary visiting along the same 
links by the same packets. 

Another factor that affects the performance of local 
routing strategy is the area of information a router can 
make use of. Based on an artificial directed World-Wide- 
Web model (see some recently proposed theoretical mod- 
els of directed World- Wide- Web (iSE^), Tadic et al. in- 
vestigated a local routing protocol with finite buffer size 
of each router, and found that the next-to-nearest rout- 
ing algorithm can perform much better than the nearest 
routing algorithm [45j . In this model each packet follows 
a random-walk unless its destination appears within the 
current router's searching area, and the next-to-nearest 
routing algorithm means the router can directly deliver a 
packet to its destination if this destination is within two 
steps. 



IV. THE CRITICAL PHENOMENA AND 
SCALING BEHAVIORS OF TRAFFIC 



FIG. 4: (Color online) The traffic rate process for free (red), 
critical (blue) and congested (black) states with different {3. 
All the data are obtained from an artificial traffic model 1321. 



the degree of node i, and a is an adjustable parameter. 
Similar to the models mentioned in the last section, the 
first-in- first-out (FIFO) discipline is applied at the queue 
of each node. Another important rule, named path itera- 
tion avoidance (PIA), is that any edge cannot be visited 
more than twice by the same packet. Set the delivering 
capability of each node as a constant C, the simulation 
results show that the optimal performance of the whole 
system corresponds to a = 1 (see Fig. 3). This opti- 
mal value can also be analytically obtained ^3 ■ Further 
more, if the delivering capability of each node is propor- 
tional to^its degree, the optimal value of a will shift to 

It is worthwhile to emphasize that the behavior of Yin 
et al.'s model 13 similar to that of Yan et al.'s model 
[32|. and the behavior of Wang et al.'s model 01 * s sml ~ 
ilar to that of Zhao et al.'s model [lfj. These resem- 
blances indicate the existence of some common policies 
between the design of global and local routing protocols, 
that is, to bypass the hub nodes or to improve the deliv- 
ering capability of these nodes can sharply enhance the 
throughput of the whole network. 

Note that, each router in the present models 0, 
needs to know all its neighbors' degrees and a packet 
has to remember the links its has visited, which requires 
much intelligence of the system. It may damage the ad- 
vantage of local routing strategy since to implement the 
PIA rule will make this system even more complicated 



Recent empirical studies on communication network 
have found pervasive evidence of some surprising scal- 
ing properties, one example of such discoveries is that 
the traffic rates of both a given link in the Internet and 
a local Ethernet exhibit the self-similarity (or fractal- 
like) scaling, and the multifractal scaling is also found 
over small time scale 0, 0, EE El, • These empiri- 
cal studies describe pertinent statistical characteristics of 
temporal dynamics of measured traffic rate process and 
provide ample evidence that these traces are consistent 
with long-range correlated behavior. Furthermore, the 
observation of a phase transition between the free-flow 
phase and the congested phase in the Internet traffic is 
demonstrated by Takayasu et al. through both the round 
trip time ex per iment |5lL 153 ] and packet flow fluctuation 
analysis |53ll54| . They found that the real traffic exhibits 
the I//-type scaling, however, this I// scaling can only 
be observed near the critical state |5fl l5"3 . l53l l54| . 

Cai et al. investigated the scaling behavior of 

mimic traffic rate process near at the critical point gen- 
erated by an the model of Yan et al. [32, Hij. Fig. 4 
reports the average number of packets over all the nodes, 
W(t) — W(t)/N, as a time series in free, critical and 
congested states, respectively. The behaviors of W(t) in 
the free and congested states are very simple: In the for- 
mer case, it fluctuates slightly around a very low value, 
while in the latter case, it increases linearly. However, the 
time series at the critical point is very complicated, which 
exhibits some large fluctuations like those have been ob- 
served in the real traffic 56] . This reason resulting in this 
phenomenon may be the usage of global routing strategy 
that leads to a possible long-range correlation, since this 
phenomenon can not be detected from the random-walk 
model [3(| and the model based on local protocol [4lll4^ |. 
However, a very similar phenomenon is also observed in 
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(4) For a given box size t, we calculate the root-mean- 
square fluctuation 



time scale log t 



FIG. 5: (Color online) The detrended fluctuation analysis 
of the traffic rate processes generated by an artificial traffic 
model |3^ |. All the data are obtained from the critical state, 
and the different curves represent the cases of different /3 from 
to 1.9 at step 0.1. 



a traffic model with local protocol [5?J, where the buffer 
size of each router is finite. 

In the previous studies, the autocorrelation function 
and power spectrum are widely used to analyse the self- 
similar scaling behaviors of both real [5^, |5j| and arti- 
ficial [53 traffic data. However, it is shown that all the 
above methods do not work very well for the effect of 
non-stationary [6(j, and are less accurate than the de- 
trended fluctuation analysis (DFA) proposed by Peng et 
al. |6lLl62| . which has now been accepted as an important 
time series analysis approach and widely used especially 
for financial and biological data |63, |6J, |65|, |6^, |62| • 

The DFA method is based on the idea that a correlated 
time series can be mapped to a self-similar process by an 
integration. Therefore, measuring the self-similar feature 
can indirectly tell us the information about the correla- 
tion properties. Briefly, the description of the DFA algo- 
rithm involves the following steps. 

(1) Consider a time series Xi, i = 1, • • • ,N, where N is 
the length of this series. Determine the profile 



y(i) 



where 



k=l 



(x) 



1 N 
-Y 



,N, 



(6) 



(7) 



(2) Divide profile y(i) into non-overlapping boxes of 
equal size t. 

(3) Calculate the local trend y f it in each box of size 
t by a least-square fit of the series, and the detrended 
fluctuation function is given as 



F(t) 



\ 



1 N 



(9) 



Y(k) = y(k)-y fit (k). 



(8) 



and repeat the above computation for different box sizes 
t (i.e. different scales) to provide a relationship between 
F and t. If the curve F(t) in a log-log plot displays 
a straight line, then F(t) obeys the power-law form t H 
with H the slope of this line. 

As shown in Fig. 5, the traffic rate time series gen- 
erated by the model of Yan et al. [12 also exhibits the 
self-similar scaling behaviors at the critical point of phase 
transition. The scaling exponents calculated with DFA 
for different (3 are approximate H w 1, and the value 
of (3 has almost no effect on H. This value of H im- 
plies the 1 / /-type scaling in the power spectrum and the 
long-range correlated behavior in a wide range of scales. 
A very recent empirical study on the traffic rate process 
of a University Ethernet has demonstrated that the real 
Ethernet traffic displays a self-similarity behavior with 
scaling exponent w 0.98 68], which agrees well with the 
present result H ~ 1. 



V. CONCLUSION AND DISCUSSION 

The studies of network traffic are now in the ascendant. 
Many new discoveries, especially the role of network het- 
erogeneity that can sharply reduce the traffic capacity, 
provide us with new scenarios and problems in under- 
standing and controlling traffic congestions. Physicists 
being not only the new object named scale-free networks, 
but also the new methodologies much different from those 
usually used in the engineering science. As an end of this 
brief review, we list a few interesting problems still un- 
solved below. 

Problem 1: The visual field of router may be one of 
the most important factors that affects the traffic capac- 
ity of whole networks. In the random walk 36] the router 
knows nothing about the top olog ical information; in the 
preferential routing strategy J4y, the router knows the 
topological information of all its nearest neighbors; in 
the global routing protocol 0, |32, each router knows 
the whole topological information. Clearly, with wider 
visual field, the system can perform better. Tadic and 
Rodgers [45[ proposed a local traffic model where each 
router knows the topological information of all its near- 
est and next-nearest neighbors, which, as expected, has 
obviously higher throughput than the case where only 
the nearest neighbors' information is available. Up to 
now, it lacks a systemic study on the effect of router's vi- 
sual field on the traffic condition of networks, which may 
worth some further work. 

Problem 2: A router can memorize huge information 
about shortest or efficient paths that, at least, can be 
used to implement the strategy of fixed routing table is 
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very expensive. Even worth, the current technology does 
not support the router to do dynamical path-finding in 
large-size networks. So, a relative problem to the preced- 
ing one is that what will happen if one mixes the global 
and local protocols together, that is, a few routers in 
the networks can do global path-finding or have mem- 
orized the shortest/efficient paths and others can only 
perform local protocol. A further question is that if the 
addition of a few very powerful routers to a traffic sys- 
tem based on local protocol can sharply enhance the net- 
work throughput, which locations should these powerful 
routers choose? 

Problem 3: Although = 1 corresponds to the optimal 
value of network throughput when using efficient-path 
finding strategy |22j > it is really a bad strategy when the 
traffic density is sparse since to bypass the hub nodes will 
increase the average distance between source and desti- 
nation. If the traffic density is sparse, the strategy with 
(3 > will waste time. Therefore, a natural question 
raises: How to use the dynamical information to improve 
the performance of network traffic? Can we design some 
on-line algorithms to guide the information packets? 

Problem ^: On one hand, in the network traffic dynam- 
ics, the maximal betweenness centrality -B max is the key 
factor that determines the network throughput R c since 
the node having maximal betweeness centrality is the 
bottleneck of information traffic thus is firstly congested. 
On the other hand, the node having maximal betweeness 
centrality is also the bottleneck that hinders the syn- 
chronization signal's communication, thus the network 
of higher -B max may of poorer synchroniz ability [69l Fof . 
Therefore, we guess there may exist some common fea- 
tures between network traffic and network synchroniza- 
tion, although they seem completely irrelevant. Actually, 
some recently proposed methods used to enhance the net- 
work synchronizability can also be used to enhance the 
network throughput [Hi [fl EH Iz3 ■ An in-depth inves- 
tigation is of great theoretical interest and we want to 
know if those two different kinds of dynamics, traffic and 
synchronization, belonging to some kind of "universality 



class" . 

Problem 5: The routing strategies for real Internet |75| 
is of special interest for its practical significance. How- 
ever, the real Internet is highly clustered and displaying 
hierarchical structure |5|, thus far from the extensively 
studied BA networks. Although there exists some highly- 
clustered models with hierarchical structures [zl, lZ3' 
they can not capture the detailed topological properties 
of real Internet. We have noticed that a recent model jT^ 
aiming at Internet is very close to the reality, thus it is 
interesting to explore the difference between simulation 
results based on BA networks and the model of Zhou and 
Mondragon pf . 

Problem 6: The previous studies mainly focus on the 
information flow and corresponding routing strategies. 
However, in the urban traffic, it is not the routes but the 
drivers are intelligent. How can they make use of traf- 
fic information to shorten their travelling time |79j| . and 
whether the selfish of each agent will reduce the system 
profit |3? 
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